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1 TACTAAAGGG AACAAAAGCT GGAGCTCCAC CGCGGTGGCG GCCGCTCTAG AACTAGTGGA 
ATGATTTCCC TTGTTTTCGA CCTCGAGGTG GCGCCACCGC CGGCGAGATC TTGATCACCT 

5 1 UTR 



TCCCCCGGGC TGCAGGAATT CGGCACGAGG AACTTTCTGC CTCGTTTTTT TGCTCCTACT 
AGGGGGCCCG ACGTCCTTAA GCCGTGCTCC TTGAAAGACG GAGCAAAAAA ACGAGGATGA 
5' UTR SEQ ID NO: 3 



M S 

GTTTTTCTCT TCCAGTTTCT ACCATGTCGC 
CAAAAAGAGA AGGTCAAAGA TGGTACAGCG 
SEQ ID NO: 3 



IITN AKS GTV 
TCATCACTAA CGCCAAATCC GGCACAGTTG 
AGTAGTGATT GCGGTTTAGG CCGTGTCAAC 

IIGF PKH GGT 
TTATTGGATT TCCCAAGCAT GGAGGAACAA 
AATAACCTAA AGGGTTCGTA CCTCCTTGTT 



QEIV QSG QTY 
AAGAAATTGT TCAATCAGGA CAAACCTACA 
TTCTTTAACA AGTTAGTCCT GTTTGGATGT 



VDLS GED NKS 
TTGACCTTTC GGGCGAAGAC AACAAATCTA 
AACT GG AAAG CCCGCTTCTG TTGTTTAGAT 

NQRW TLN WTG 
ATCAGAGGTG GACCCTCAAC TGGACAGGGA 
TAGTCTCCAC CTGGGAGTTG ACCTGTCCCT 
SEQ ID NO: 5 



KSWT FRS VSS EMYL GLN GSP 
AGAGTTGGAC TTTCCGCTCC GTTTCTTCTG AAATGTATCT TGGCCTGAAT GGCTCGCCGT 
TCTCAACCTG AAAGGCGAGG CAAAGAAGAC TTTACATAGA ACCGGACTTA CCGAGCGGCA 

SEQ ID NO: 4 (partial) 



SEQ ID NO: 5 



SEQ ID NO: 6 (partial) 



S D G T KLV A V T TPVE WRI WH 
CTGATGGAAC AAAACTGGTA GCCGTGACCA CCCCTGTTGA GTGGCGCATC TGGCACGA 418 
GACTACCTTG TTTTGACCAT CGGCACTGGT GGGGACAACT CACCGCGTAG ACCGTGCT 
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5' UTR 



START 

MS Q E I 

1 GCCTCGTTTT TTTGCTCCTA CTGTTTTTCT CTTCCAGTTT CTACCATGTC GCAAGAAATT 

CGGAGCAAAA AAACGAGGAT GACAAAAAGA GAAGGTCAAA GATGGTACAG CGTTCTTTAA 

VQS GQTY IIT NAK SGTV VDL 
61 GTTCAATCAG G ACAAACC T A CATCATCACT AACGCCAAAT CCGGCACAGT TGTTGACCTT 

CAAGTTAGTC CTGTTTGGAT GTAGTAGTGA TTGCGGTTTA GGCCGTGTCA ACAACTGGAA 

SGE DNKS IIG FPK HGGT NQR 
121 TCGGGCGAAG ACAACAAATC TATTATTGGA TTTCCCAAGC ATGGAGGAAC AAATCAGAGG 

AGCCCGCTTC TGTTGTTTAG ATAATAACCT AAAGGGTTCG TACCTCCTTG TTTAGTCTCC 

WTL NWTG KSW 
181 TGGACCCTCA ACTGGACAGG GAAGAGTTGG A 211 
ACCTGGGAGT TGACCTGTCC CTTCTCAACC T 
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VDL SGE DNK SIIG F P K HGG 
1 TTGTTGACCT TTCGGGCGAA GACAACAAAT CTATTATTGG ATTTCCCAAG CATGGAGGAA 
AACAACTGGA AAGCCCGCTT CTGTTGTTTA GATAATAACC TAAAGGGTTC GTACCTCCTT 

TNQR WTL NWT GKSW TFR SVS 
61 CAAATCAGAG GTGGACCCTC AACTGGACAG GGAAGAGTTG GACTTTCCGC TCCGTTTCTT 
GTTTAGTCTC CACCTGGGAG TTGACCTGTC CCTTCTCAAC CTGAAAGGCG AGGCAAAGAA 

SEMY LGL NGS PSDG TKL VAV 
121 CTGAAATGTA TCTTGGCCTG AATGGCTCGC CGTCTGATGG AACAAAACTG GTAGCCGTGA 
GACTTTACAT AGAACCGGAC TTACCGAGCG GCAGACTACC TTGTTTTGAC CATCGGCACT 

TTPV EWH IWH DEVD PST YRI 
181 CCACCCCTGT TGAGTGGCAC ATCTGGCACG ACGAAGTTGA CCCTTCAACT TATCGTATCT 
GGTGGGGACA ACTCACCGTG TAGACCGTGC TGCTTCAACT GGGAAGTTGA ATAGCATAGA 

A/G polymorphism 

FVPF TTF NMD LYAQ GSA APG 
241 TTGTACCTTT CACCACATTC AACATGGACC TCTACGCCCA RGGTAGTGCC GCCCCTGGTA 
AACATGGAAA GTGGTGTAAG TTGTACCTGG AGATGCGGGT YCCATCACGG CGGGGACCAT 

T/C polymorphism 

TPIT TWY TWK GIHQ TWR FEL 
301 CGCCTATCAC AACTTGGTAT ACATGGAAGG GYATCCACCA AACGTGGAGG TTTGAACTAG 
GCGGATAGTG TTGAACCATA TGTACCTTCC CRTAGGTGGT TTGCACCTCC AAACTTGATC 

T/G polymorphism 

STOP 

3' UTR 



A * 

361 CTTAGGKTCA GGTTTCGGAT GTAATTTGTG TGTGTAAATC TTCTTGGACC ATGTTGTGCT 
GAATCCMAGT CCAAAGCCTA CATTAAACAC ACACATTTAG AAGAACCTGG TACAACACGA 

3' UTR 



4 21 TTTATTGTAC TCCGCTTGTT 
AAATAACATG AGGCGAACAA 
PolyA tail 



3' UTR 

481 CAAAAAAAAA AAA 493 
GTTTTTTTTT TTT 



ATCATTATAC CCACCTATGT 
TAGTAATATG GGTGGATACA 



TGCAACATCT TTTTGGATCC 
ACGTTGTAGA AAAACCTAGG 
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START 

M SQE IVQ SGQT YII 

1 TCTCTTCCAG TTTCTACCAT GTCGCAAGAA ATTGTTCAAT CAGGACAAAC CTACATCATC 

AGAGAAGGTC AAAGATGGTA CAGCGTTCTT TAACAAGTTA GTCCTGTTTG GAT G TAG TAG 

T N A KSGT VVD LSG EDNK SII 
61 ACTAACGCCA AATCCGGCAC AGTTGTTGAC CTTTCGGGCG AAGACAACAA ATCTATTATT 
TGATTGCGGT TTAGGCCGTG TCAACAACTG GAAAGCCCGC TTCTGTTGTT TAGATAATAA 

G F P KHGG TNQ RWT LNWT GKS 
121 GGATTTCCCA AGCATGGAGG AACAAATCAG AGGTGGACCC TCAACTGGAC AGGGAAGAGT 
CCTAAAGGGT TCGTACCTCC TTGTTTAGTC TCCACCTGGG AGTTGACCTG TCCCTTCTCA 

WTF RSVS SEM YLG LNGS PSD 
181 TGGACTTTCC GCTCCGTTTC TTCTGAAATG TATCTTGGCC TGAATGGCTC GCCGTCTGAT 
ACCTGAAAGG CGAGGCAAAG AAGACTTTAC ATAGAACCGG ACTTACCGAG CGGCAGACTA 

GTK LVAV TTP VEW HIWH DEV 
241 GGAACAAAAC TGGTAGCCGT GACCACCCCT GTTGAGTGGC ACATCTGGCA CGACGAAGTT 
CCTTGTTTTG ACCATCGGCA CTGGTGGGGA CAACTCACCG TGTAGACCGT GCTGCTTCAA 

DPS TYRI FVP FTT FNMD LYA 
301 GACCCTTCAA CTTATCGTAT CTTTGTACCT TTCACCACAT TCAACATGGA CCTCTACGCC 
CTGGGAAGTT GAATAGCATA GAAACATGGA AAGTGGTGTA AGTTGTACCT GGAGATGCGG 

A/G polymorphism C/T polymorphi 

QGS AAPG TPI TTW YTWK GIH 
361 CAAGGTAGTG CCGCCCCTGG TACGCCTATC ACAACTTGGT ATACATGGAA GGGCATCCAC 
GTTCCATCAC GGCGGGGACC ATGCGGATAG TGTTGAACCA TATGTACCTT CCCGTAGGTG 

G/T polymorphism 

STOP 

QTW REEL A* 
4 21 CAAACGTGGA GGTTTGAACT AGCTTAGGGT CAGGTTTCGG ATGTAATTTG T 4 91 
GTTTGCACCT CCAAACTTGA TCGAATCCCA GTCCAAAGCC TACATTAAAC A 
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FIG. 6 

START 

exon 1 



M SQE IVQ SGQT YII 
1 TCTCTTCCAG TTTCTACCAT GTCGCAAGAA ATTGTTCAAT CAGGACAAAC CTACATCATC 
AGAGAAGGTC AAAGATGGTA CAGCGTTCTT TAACAAGTTA GTCCTGTTTG GATGTAGTAG 

exon 1 



intron 1 



T N A KSGT VVD LSG EDNK S 
61 ACTAACGCCA AATCGGGCAC AGTTGTTGAC CTTTCGGGCG AAGACAACAA ATCTAGTAAG 
TGATTGCGGT TTAGGCCGTG TCAACAACTG GAAAGCCCGC TTCTGTTGTT TAGATCATTC 

intron 1 



121 TCGTTTTTAG TCCCATGTTT TTTTTTGTCA AAAAAAATTG ACTGACATAT TTTGTCTCCA 
AGCAAAAATC AGGGTACAAA AAAAAACAGT TTTTTTTAAC TGACTGTATA AAACAGAGGT 

exon 2 



intron 2 



IG FPKH GGT NQR 
181 GTTATTGGAT TTCCCAAGCA TGGAGGAACA AATCAGAGGG TAGGTCTAGA AATGCACCTC 
CAATAACCTA AAGGGTTCGT ACCTCCTTGT TTAGTCTCCC ATCCAGATCT TTACGTGGAG 

exon 3 



intron 2 



WT LNW TGKS 
241 GTTAATATTG GTTTTTATTG ACATTCATGA ACAGTGGACC CTCAACTGGA CAGGGAAGAG 
CAATTATAAC CAAAAATAAC TGTAAGTACT TGTCACCTGG GAGTTGACCT GTCCCTTCTC 

exon 3 



WTF RSV SSEM YLG LNG SPSD 

301 TTGGACTTTC CGCTCCGTTT CTTCTGAAAT GTATCTTGGC CTGAATGGCT CGCCGTCTGA 

AACCTGAAAG GCGAGGCAAA GAAGACTTTA CATAGAACCG GACTTACCGA GCGGCAGACT 

exon 3 



GTK LVA VTTP VEW HIW HDEV 
■361 TGGAACAAAA CTGGTAGCCG TGACCACCCC TGTTGAGTGG CACATCTGGC ACGACGAAGT 
ACCTTGTTTT GACCATCGGC ACTGGTGGGG ACAACTCACC GTGTAGACCG TGCTGCTTCA 
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exon 3 intron 3 



DPS T Y 

4 21 TGACCCTTCA ACTTATCGGT GAGTCCCCTA AATATTACTT GCTTGTGGTT CATACTAATA 
ACTGGGAAGT TGAATAGCCA CTCAGGGGAT TTATAATGAA CGAACACCAA GTATGATTAT 

intron 3 exon 4 



IF VPFT TFN MDL YAQG 
481 CGTCGTTCGA AGTATCTTTG TACCTTTCAC CACATTCAAC ATGGACCTCT ACGCCCAGGG 
GCAGCAAGCT TCATAGAAAC ATGGAAAGTG GTGTAAGTTG TACCTGGAGA TGCGGGTCCC 

exon 4 



SAA PGT PITT WYT WKG IHQT 
541 TAGTGCCGCC CCTGGTACGC CTATCACAAC TTGGTATACA TGGAAGGGTA TCCACCAAAC 
ATCACGGCGG GGACCATGCG GATAGTGTTG AACCATATGT ACCTTCCCAT AGGTGGTTTG 

intron 4 



exon 4 



W R F EL 

601 GTGGAGGTTT GAACTAGGTA GGGCTTGCGA TCTCACCCGG ATCCTCCATG AACTAATGTG 
CACCTCCAAA CTTGATCCAT CCCGAACGCT AGAGTGGGCC TAGGAGGTAC TTGATTACAC 

intron 4 STOP 



661 ATCACGTCGT GTTCTAGCTT AGGTTCAGGT TTCGGATGTA ATTTGT 706 
TAGTGCAGCA CAAGATCGAA TCCAAGTCCA AAGCCTACAT TAAACA 
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